Effects of Easy Hybrid Parallelization with CUDA for Numerical-Atomic-Orbital Density Functional Theory Calculation

نویسندگان

  • Jae-Hyeon Parq
  • Erik Sevre
  • Sang-Mook Lee
چکیده

We modified a MPI-friendly density functional theory (DFT) source code within hybrid parallelization including CUDA. Our objective is to find out how simple conversions within the hybrid parallelization with mid-range GPUs affect DFT code not originally suitable to CUDA. We settled several rules of hybrid parallelization for numerical-atomic-orbital (NAO) DFT codes. The test was performed on a magnetite material system with OpenMX code by utilizing a hardware system containing 2 Xeon E5606 CPUs and 2 Quadro 4000 GPUs. 3-way hybrid routines obtained a speedup of 7.55 while 2-way hybrid speedup by 10.94. GPUs with CUDA complement the efficiency of OpenMP and compensate CPUs’ excessive competition within MPI. ∗Electronic mail: [email protected] 1 ar X iv :1 40 2. 42 47 v1 [ cs .D C ] 1 8 Fe b 20 14

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effects of Easy Hybrid Parallelization with CUDA for OpenMX

A MPI-friendly density functional theory (DFT) source code was modified within hybrid parallelization including CUDA. The objective is to find out how simple conversions within the hybrid parallelization with mid-range GPUs affect DFT code not originally suitable to CUDA. Several rules of hybrid parallelization for numerical-atomic-orbital (NAO) DFT codes were settled. The test was performed on...

متن کامل

Electronic Structure Investigation of Octahedral Complex and Nano ring by NBO Analysis: An EPR Study

To calculation non-bonded interaction of the [CoCl6]3- complex embedded in nano ring, we focus on the single wall boron-nitride B18N18 nano ring. Thus, the geometry of B18N18 nano ring has been optimized by B3LYP method with EPR-II (Electron paramagnetic resonance) basis set and geometry of the [CoCl6]3- complex has been optimized at B3LYP method with Aldrich’s VTZ basis set and Stuttgart RSC 1...

متن کامل

Electronic Structure Investigation of Octahedral Complex and Nano ring by NBO Analysis: An EPR Study

To calculation non-bonded interaction of the [CoCl6]3- complex embedded in nano ring, we focus on the single wall boron-nitride B18N18 nano ring. Thus, the geometry of B18N18 nano ring has been optimized by B3LYP method with EPR-II (Electron paramagnetic resonance) basis set and geometry of the [CoCl6]3- complex has been optimized at B3LYP method with Aldrich’s VTZ basis set and Stuttgart RSC 1...

متن کامل

Adsorption of Vitamin C on a Fullerene Surface: DFT Studies

Density functional theory (DFT) calculations were performed to investigate adsorptions of vitamin C (Vit) on the surface a fullerene structure (Ful) in gaseous and water–solvated systems. Two models of Vit including OVit and MVit were created based on the original structure of Vit for OVit and methylation of all hydroxyl groups for MVit. All singular and hybrid structures were optimized and the...

متن کامل

An Ab initio and chemical shielding tensors calculations for Nucleotide 5’-Monophosphates in the Gas phase

Structural and magnetic properties of purine and pyrimidine nucleotides (CMP, UMP, dTMP, AMP, GMP, IMP) were studied at different levels of ab initio molecular orbital theory. These calculations were performed at the hartree-fock level and density functional B3LYP methods. Geometries were fully optimized by following Cs symmetry restrictions. The standard 6-31G** basis set which includes polari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1402.4247  شماره 

صفحات  -

تاریخ انتشار 2014